User-directed Exploration of Mining Space with Multiple Attributes
نویسندگان
چکیده
There has been a growing interest in mining frequent itemsets in relational data with multiple attributes. A key step in this approach is to select a set of attributes that group data into transactions and a separate set of attributes that labels data into items. Unsupervised and unrestricted mining, however, is stymied by the combinatorial complexity and the quantity of patterns as the number of attributes grows. In this paper, we focus on leveraging the semantics of the underlying data for mining frequent itemsets. For instance, there are usually taxonomies in the data schema and functional dependencies among the attributes. Domain knowledge and user preferences often have the potential to significantly reduce the exponentially growing mining space. These observations motivate the design of a userdirected data mining framework that allows such domain knowledge to guide the mining process and control the mining strategy. We show examples of tremendous reduction in computation by using domain knowledge in mining relational data with multiple attributes.
منابع مشابه
Amplitude versus Offset (AVO) Technique for Light Hydrocarbon Exploration: A Case Study
AVO as a known methodology is used to identify fluid type and reservoir lithology in subsurface exploration. Method discussed in this paper, consists of three stages, including: Direct modeling, Inverse modeling and Cross plot interpretation. By direct modeling we can clarify lithology or fluid dependent attributes. Analysis performed using both P-P and P-Sv attributes. Inverse modeling deals w...
متن کاملTowards a Framework for Semantic Exploration of Frequent Patterns
Mining frequent patterns is an essential task in discovering hidden correlations in datasets. Although frequent patterns unveil valuable information, there are some challenges which limits their usability. First, the number of possible patterns is often very large which hinders their effective exploration. Second, patterns with many items are hard to read and the analyst may be unable to unders...
متن کاملGeo-visualization Support for Multidimensional Clustering
In this paper we consider how multidimensional clustering can be complemented by interactive visualization. We propose a link between geovisualization and data mining systems for supporting an iterative analysis cycle, including data pre-processing and visual exploration, automatic detection of clusters in multidimensional space of user-selected attributes, and visual analysis of cluster analys...
متن کاملInteractive Visualization of the Market Graph
Financial markets are a fruitful area for data exploration, but the overwhelming size and dimension of the datasets usually prohibit meaningful analysis, especially on a large scale. Thus, there is a need for effective visualization tools to assist in efficiently exploring the data space. In this paper, we present a novel visualization tool that empowers a user with an interactive tool for find...
متن کاملConnecting Segments for Visual Data Exploration and Interactive Mining of Decision Rules
Visualization has become an essential support throughout the KDD process in order to extract hidden information from huge amount of data. Visual data exploration techniques provide the user with graphic views or metaphors that represent potential patterns and data relationships. However, an only image does not always convey high–dimensional data properties successfully. From such data sets, vis...
متن کامل